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Abstract 

We consider the discrete time threshold-two contact process on a random r-regular 
graph on n vertices. In this process, a vertex with at least two occupied neighbors at 
time t will be occupied at time t + 1 with probability p, and vacant otherwise. We use a 
suitable isoperimetric inequality to show that if r > 4 and p is close enough to 1 , then 
starting from all vertices occupied, there is a positive density of occupied vertices up to 
time exp(c(p)n) for some constant c(p) > 0. In the other direction, another appropriate 
isoperimetric inequality allows us to show that there is a decreasing function 62 (p) and 
a constant Cb(p) := 2/log(2/(l + p)) so that if the number of occupied vertices in the 
initial configuration is < €2{p)n, then with high probability all vertexs are vacant at 
time Co(p)logn. These two conclusions imply that the density of occupied vertexs in 
the quasi-stationary distribution (defined in the paper) is discontinuous at the critical 
probability p c E (0, 1). 
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1 Introduction 



Interacting particle systems are often formulated on the d- dimensional integer lattice Z d . 
See e.g. Lig85| or |Lig99| . However, if one is considering the spread of influenza in a town, 



infections occur not only between individuals who live close to each other, but also over long 
distances due to social contacts at school or at work. Because of this, one should consider 
how these stochastic spatial processes change when the regular lattice is replaced by the 
random graphs that have been used to model social networks. 

[DJ07] considers the contact process on a small world graph S. In the contact process, 
each vertex is either occupied or vacant. Occupied vertices become vacant at rate 1, while 
vacant vertices become occupied at rate A times the number of occupied neighbors. The 
small world random graph, which |DJ07j considers, is a modification of the d- dimensional 
torus 71 := (Z mod L) d in which each vertex has exactly one long-distance neighbor, where 
the long-distance neighbors are defined by a random pairing of the vertices of the torus. 

The contact process on the small world (or on any finite graph) cannot have a non- 
trivial stationary distribution, because it is a finite state Markov chain with an absorbing 
state. However, on the small world and many other graphs, there is a "quasi-stationary 
distribution" which persists for a long time. To explain the concept in quotes, we recall the 
situation for the contact process on the d- dimensional torus 71- Let C Z d denote the 
contact process on Z d starting with single occupied vertex at the origin and let 

A c := inf{A : P(^oo) > 0}, where fi^ := {C t ° + for all t}. 

Let Q C Z d denote the contact process on Z d starting with all vertices occupied. Mono- 
tonicity and self-duality imply that (see |Lig99|) if A > A c and := lim^oo^ 1 , where 



the limit is in distribution, then is a translation invariant stationary distribution with 

p(x g cl) = P(noo). 

Returning to the torus 71 and letting Q ' L C 71 denote the contact process on it starting 
from all vertices occupied, if A < A c , then there is a fci(A) > so that -P(C^miogn 7^ 0) ~ ^ 
as n — > oo, where n = L d is the number of vertices in 71. If A > A c , then with high 
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probability Q' L persists to time exp(/c2(A)n) for some /^(A) > 0. Furthermore, at times 
1 <^ t < exp(/c2(A)n) the finite dimensional distributions of Ct' 7 ^ are close to those of (see 
Lig99]). Thus the quasi-stationary distribution for the contact process on the finite graph 



is like the stationary distribution for the contact process on the associated infinite graph. 

Locally, the small world graph S looks like an infinite graph that is called the big world 
B in [L)J07J. In this graph, traversing a long range edge brings one to another copy of Z d . 
Sophisticates will recognize this as the free product Z d * {0, 1}, where the second factor is 
Z mod 2. Like the contact process on the homogeneous tree, the contact process on B has 
two phase transitions Ai < A2, which correspond to global and local survival respectively. 

t3 

That is, if Ct ' Q & denotes the contact process on B starting with single occupied vertex at 
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the origin, then 



Ai := inf{A : P(f2* ) > 0} and 

A 2 := inf | A : HminfF (o G C°' B ) > o} , 

where as earlier = {(t' B 7^ for all t}. Let denote the contact process on B starting 
with all vertices occupied. Monotonicity and duality imply that if A > Ai and := 

1 £3 1 w 

lim^oo ( t ' , where the limit is in distribution, then is a translation invariant stationary 
distribution with P(x G (If) = p (^). 

In order to study the persistence of the contact process (I' C S on the small world S, 
[DJ07] introduces births at a rate 7 from each vertex, which go from an occupied vertex to 
a randomly chosen vertex. With this modification it is shown that if A > Ai, then there is 
a constant k 3 = k 3 (\,^y) > so that for n = L d , persists to time exp(fc 3 n) with high 
probability. 

In this paper, we study the behavior of the discrete time threshold-two contact process on 
a random r-regular graph on n vertices. We construct our random graph G n on the vertex 
set V n := {1, 2, ... n} by assigning r "half-edges" to each of the vertices, and then pairing the 
half-edges at random. If r is odd, then n must be even so that the number of half-edges, rn, 
is even to have a valid degree sequence. Let P denote the distribution of G n . We condition 
on the event E n that the graph is simple, i.e. it does not contain a self-loop at any vertex, 
or more than one edge between two vertices. It can be shown (see e.g. Corollary 9.7 on page 
239 of [JL ROO] ) that f(E n ) is bounded away from 0, and hence for large enough n, 

if P := F(-\E n ), then P(-) < cP(-) for some constant c = c(r) > 0. (1.1) 

So the conditioning on the event E n will not have much effect on the distribution of G n . Since 
the resulting graph remains the same under any permutation of the half-edges corresponding 
to any vertex, the distribution of G n under P is uniform over the collection of all undirected 
r-regular graphs on the vertex set V n . We choose G n according to the distribution P on 
simple graphs, and once chosen the graph remains fixed through time. 
We write x ~ y to mean that a; is a neighbor of y, and let 

M y := {x G V n : x ~ y} (1.2) 

be the set of neighbors of y. The distribution Pg„. p of the (discrete time) threshold-two 
contact process £ t C V n with parameter p conditioned on G n can be described as follows: 

P Gn , P G 6+1 I Wx n 61 > 2) =p and 

P Gn , P (xe^t+i I \Kn£t\ <2) = o, 

where the decisions for different vertices at time t + 1 are taken independently. If P p denotes 
the distribution of the threshold-two contact process £t on the random graph G n having 
distribution P, then 

P P (-)=EP G „, P (-), 
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where E is the expectation corresponding to the probability distribution P. 

Let (/ C 14 denote the threshold-two contact process starting from £g = A, and let £ t x 
denote the special case when A = V n . In the long history of the contact process the first step 
was to study whether the critical value of the parameter lies in the interior of the parameter- 
space or not. Based on results for the threshold contact process on random directed graph 
in |CD] , and basic contact process on the small world S in [DJ07] , it is natural to expect 
the existence of a critical value p c G (0, 1) defining the boundary between rapid convergence 
within logarithmically small time to all-zero configuration for p < p c , and exponentially 
prolonged persistence of changes for p > p c . We define the boundary p c between convergence 
to the all-zero configuration within time C(p) logn, and exponentially prolonged persistence 
as 

p c := inf \ p G [0, 1] : lim P„ ( inf > u(p) J = 1 for some k(p), u(p) > X . (1.3) 

7i~s>oo \t<cxp(k(p)n) Tl J J 

In order to show that p c < 1, it suffices to show that if p is sufficiently close to 1, then 
maintains a positive fraction of occupied vertices for time > exp(cin) for some constant 
ci > 0. 

Theorem 1. If r > 4 and i] G (0, 1/4), then there is an e\ = 61(77) G (0, 1) such that for 

1-ei 



and for some positive constants C\ and c\(r),p), 



<p<l, (1.4) 



P P ( mf ^-<l-e 1 )<C 1 exp(-c 1 (r t , P )n) 



t<exp(ci(r?,p)n) n 

In words, if p is sufficiently close to 1 and r is larger than 3, then the fraction of occupied 
vertices in the threshold-two contact process starting from all-one configuration remains close 
to 1 for exponentially long time with probability 1 — o(l). Here and later o(l) denotes a 
quantity that goes to as n goes to 00. So Theorem [T] confirms that p c < 1 for r > 4. The 
argument does not work for r = 3, as the lower bound in (11.41) is higher than 1 if we put 
r = 3. We believe that similar result holds for r = 3, but the problem remains open. The 
key to the proof of Theorem [1] is an 'isoperimetric inequality' (see Proposition [2] below). 

Next we study the behavior of when \A\ is small. 

Theorem 2. There is a decreasing continuous function €2 '■ (0, 1) h-> (0, 1) and a collection Q 
of simple r -regular graphs onn vertices such that for any p G (0,1), Co(p) := 2/ log(2/(l+p)), 
and any subset A C V n with \A\ < €2(p)n, 

(ii) P(^ c ) =o(l)- 
ifence lim^ P p (^ o(p)logn1 ^ 0) = 0. 
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In words, for any value of p G (0, 1), whenever the fraction of occupied vertices drops 
below a certain level depending on p, all vertices of G n become vacant within logarithmically 
small time with probability 1 — o(l). Thus the density of occupied vertices doesn't stay in the 
interval (0, e 2 (p)) for long time. The key to the proof of Theorem |2] is another 'isoperimetric 
inequality' (see Proposition [TJ below). As a consequence of Theorem [2} we have: 

Corollary 1. There is a p Q G (0, 2/3) such that for < p < po, 

lim P p (£p(c ( P )+i)iogni 7^ 0) = 0> where C (p) is as in Theorem^ 

That is, if p is sufficiently close to 0, then starting from all-one configuration all vertices of 
G n become vacant within logarithmically small time with probability 1 — o(l). So Corollary 
[U confirms that p c > 0. 

Theorem [1] shows that p c < 1, and so for p G (p c , 1) the fraction of occupied vertices in 
the graph G n is bounded away from zero for a time longer than exp(rz 1//2 ). So we can now 
define a quasi-stationary measure which is an analogue of the upper invariant measure, 
as follows. For any A C V n , ^{B : B n A ^ 0} := Pp(£f exp(n i /2)1 H A ^ 0). Let X n be 
uniformly distributed on V n , and let 



Pn := :I„65} = - 



n 



fexpfn 1 / 2 )! 



So p n is the quasi-stationary density of occupied vertices in the threshold-two contact process 
on the random graph G n . Note that p n is an analogue of the density of occupied vertices 
in the upper invariant measure for the contact process with sexual reproduction on regular 
lattices, which is conjectured to have a continuous phase transition (see Conjecture 1 and 
heuristic argument following that in |DN94j ). As we now explain, things are different in the 
threshold-two contact process on a random regular graph. 

First observe that if p > p c , then p n is bounded away from zero with high probability, 
because if p n < e 2 (p), where e 2 (-) is as in Theorem[2j then |£r exp ( n i/2)-| | < ne 2 (p). In that case, 

for a = [exp(n x / 2 )] + [Co log n], either ^ 0, which has P p -probability o(l) by Theorem 
El or £l = 0, which has P p -probability o(l) by the definition of p c in (11. 3p and the fact that 
p > p c . Therefore, for p > p c , p n > e 2 (p) with P p -probability 1 — o(l). 

Next observe that for any pi,p 2 ^ [0,1] with p 1 < p 2 , the random variables Z; L ~ 
Bernoulli(pi),i = 1,2, can be coupled so that Z\ < Z 2 - Using this coupling for all the 
Bernoulli random variables, which are used in deciding whether x G £t for x G V n , t = 1,2,..., 
it is easy to see that 

Pa n , Pl < Pc n , P2 , i-e. for any increasing event B,P GntPl (B) < P GniP2 (B). 

The same inequality holds for the unconditional probability distributions P Pl and P P2 . Since 
{pn > e} = {l^ cxp ( n i/2)^ I > en } is an increasing event, it follows that for any p > p' > p c 

P P {Pn > e 2 (p')) > P p ,{p n > e 2 (p')) = 1 - o(l) 
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by the above discussion. Taking p' sufficiently close to p c and noting that e^-) is a decreasing 
continuous function, we get the result of this paper that the threshold-two contact process 
on the random graph G n has a discontinuous phase transition at the critical value p c . 

Theorem 3. Let p := tzijPc), where €2(-) is as in Theorem^ and p c is as in (jl.3p . Then 
p > 0. For any p > p c and 5 > 0, 

lim P p (p n > p - 5) = 1. 

n—toc 

The key to the proof of Theorem [2] is an "isoperimetric inequality" . Given a subset 
U C V n , let 

U* 2 :— {y G V n : y ~ x and y ~ z for some x, z E U with a; 7^ 2}. (1.5) 

The idea behind this definition is that if U — £t for some i, then f/* 2 is the set of vertices 
which have a chance of being occupied at time t + 1. Note that U* 2 can contain vertices of 
U. 

Proposition 1. Let E(m,k) be the event that there is a subset U <zV n with size \U\ = m 
so that \U* 2 \ > k. Then there is an increasing positive function es(-) so that for any n > 
and m < 63 (r])n, 

( v 2 

P [E(m, (1 + r])m)] < C3 exp I — — m log(n/m) 

for some constant C3 = C%{r). 

In words, if U is a small set, then for any n > 0, \U* 2 \ < (l + n)\U\ with high probability. 
Now if EQ n<p is the expectation corresponding to the probability distribution -Pg ?1iP , then 
-Eg„, p (|6+i| |£t) = PlC 2 |- Given p e (0, 1), we can choose rj(p) > so that p(l + ?y(p)) < 
(1 +p)/2. So using Proposition [U if |&| is small, £; G „ )P (|6+i| 16) < +p)/2 with 

high probability. This observation together with large deviation results for the Binomial 
distribution implies that |£t+i| < |^|(1 +p)/2 with high probability if |^| is small. Finally if 
the number of occupied vertices reduces by a fraction at each time, all vertices will be vacant 
by time O(logn) and so Theorem [2] follows. 

The key to the proof of Theorem [1] is another 'isoperimetric inequality'. If W = V n \ £t 
is the set of vacant vertices at time t, then (W c )* 2 is the set of vertices which have a chance 
of being occupied at time t+1, and so ((W c )* 2 ) c is the set of vertices which will surely be 
vacant at time t + 1. 

Proposition 2. Let F(m,k) be the event that there is a subset W C V n with \W\ = m so 
that \((W C )* 2 ) C \ > k. Given 77 > 0, there are positive constants e 4 (r, 77) and C 4 (r) so that for 
m < e^n, 

3 \ \ 1 

< C 4 exp(— (77/8)771 log(n/m)). 



771, — r + 71 771 

1 2(r - 2) ; ' 
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In words, if W is a small set, then for any 77 > 0, |((W /C )* 2 ) C | < (3/(2r — 4) + rf)\W\ 
with high probability. As noted above, Ea n;P (\£t+i\ |£t) — p\£t 2 \- For p as in (jl.4j) . we can 
choose ?y) > so that (p — S)(l — (3/ (2r — 4) + 77)61) > 1 — e\. So using Proposition [2] 
with J¥ = K \ if |&|/n > 1 - ei, then E Gn>p {\£ t+1 \ |&) > p(l - (3/(2r - 4) + 77)61)71 > 
(1 — e\)np/(jp — 8) with high probability. This observation together with large deviation 
results for the Binomial distribution implies that |£t+i| < (1 — e\)n with exponentially small 
probability if > 1 — ei. Thus if r is the first time the fraction of occupied vertices 

drops below 1 — e%, then r > exp(ci(?7,p)ri) with high probability for a suitable choice of 
Ci(j],p), and so Theorem [1] follows. 

The remainder of the paper is organized as follows. In section [2} we present sketches of 
the proofs of Proposition [1] and [2j In section [3] and HI we use the propositions to study the 
behavior of & starting from a small occupied set and the fact that p c G (0, 1) respectively, 
while in section [6] and [7] we present the proofs of the propositions. Section [5] is about the 
first order phase transition at p c . Finally in section [H] we prove several probability estimates, 
which are needed in the proof of the propositions. 

2 Sketch of the proofs of the isoperimetric inequalities. 

Recall the definition of U* 2 from ( 11.5H . We need some more definitions and notations. For 
any vertex x G V n and subsets U, W C V n let dU be the boundary of the set U, U* 1 be 
the set of vertices which have at least one neighbor in U, e(U, W) be the number of edges 
between U and W. Also let Uq be the set of vertices in U which have all their neighbors in 
U c , and U\ be the complement of Uq. So 

dU := {y G U c : y ~ x for some x G U}, U* 1 := {y G V n : y ~ x for some x G U}, 

e(U,W) := \{{x,y) : x G U and y G W}\, 

U := {x G U : y ~ x implies y G U c }, U x := U H t/ Q c . 

(2.1) 

2.1 Isoperimetric inequality in Proposition [I] 

From the definitions in (12. ip it is easy to see that if \U\ — m, then 

rm> e (M' U ) ^ W* 1 \U* 2 \ + 2\U* 2 \ = {U*^ + \U* 2 \. 

So for any subset U of vertices of size m, 

if I U* 2 1 > k, then \U* X \ < rm - k. (2.2) 
In view of (12. 2p . for proving Proposition [T] it suffices to estimate the probability 

P [H(m, (r - 1 - 77)777)] , where H(m,k) = U {UcVn .. ]ul=m} {{U*^ < k] (2.3) 
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is the event that there is a subset U of vertices of size m with If/* 1 ! < k. 

Note that U* 1 is a disjoint union of dU and Lq. Our first step in estimating (12.31) . taken 
in Lemma 18.2} is to show the following. 

J. For \U\= m and any T] > 0, e(U, U c ) > (r — 2 — rf)\U\ with probability at least 
1 — exp (—(1 + T]/2)m log(n/m) + Aim) for some constant A±. 

Take a = (r—2—i])/r in Lemma IHT21 so that (1— a)r/2 = l+rj/2. We cannot hope to do better 
than r — 2. Consider a tree in which all vertices have degree r and let U be the set of vertices 
within distance k of a fixed vertex. If s = r — 1, then \U\ = l+r+rs + - • ■+rs k ~ 1 ~ rs k / (s — 1) 
and e(U, U c ) = rs k , so e(U, U C )/\U\ ws-l = r-2, 

In the next step, see Lemma [8.41 we show the following. 

II. Given e(U,U c ) = u\U\ for some constant u and rj > 0, if m — \U\ < e^{j])n, then 
\dU\ > {u — rj)\U\ with probability > 1 — exp (— i]mlog(n/m) + A 2 m) for some constant A 2 . 

Considering all possible values ofu>r — 2 — 77 and using / and If, 

\dU\ > (r — 2 — 277) I C/| with probability > 1 — 2 exp (—(1 + rj)m\og(n/m) + (Aj + A 2 )m) . 

Using the fact (see Lemma 18. ip that 

the number of subsets of V n of size m is at most exp (m log (n/m) + m), 

the expected number of subsets U of size m with \dU\ < (r — 2 — 2r])\U\ is exponentially 
small if m < e{rf)n for some small fraction e(^). Therefore, 

with high probability (f/* 1 ) > \dU\ > (r - 2 - 2r?) J CZ| , whenever |C/| < e(^)n. (2.4) 

But this is not good enough, so we need to work to improve the first inequality above. 

Recall the definitions of Uq and U\ from (12.11) . There are two possibilities based on \U\\. 
Given 77 > 0, if \U X \ < {r]/2r)\U\, then e(U, U c ) > r\U \ > (r - r]/2)\U\ . So using II, 

if \U\ = m, then \dU\ < (r — 1 — rj)\U\ and \U\\ < (r)/2r)\U\ with probability at most 
exp (—(1 + r]/2)m log(n/m) + A2m) . 

Combining with Z/7 the expected number of subsets of size m with the above property is 
exponentially small, if m < e{rf)n. Therefore, 

with high probability \U* l \ > \dU\ > (r — 1 — rj)\U\ whenever \U\\ < (r//2r)\U\. 

Next we look at the other possibility |Z7i| > (rj/2r)\U\. Using an argument similar to the 
one leading to (12. 4p . 

with high probability e(/7i,C/f) > {r—2—r])\U\\ whenever \U\ < e(r])n and \U\\ > (j]/2r)\U\. 

(2.5) 

Using the equalities e(U , U c ) = e(U , Uq) = r\U \ and e(Ui,U c ) = e(L r i,L r ( : ), we have 
e(U, U c ) = r\U \ +e(Ui, Uf). Combining this with another equality \U* l \ = \Ui\ + \dU\ and a 
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little algebra give that {\U* X \ < (r- 1 -ij)\U\} = {e(U, U c ) - \dU\ > (l + rj)\U Q \ +e(U u C/f) - 
(r — 2 — i])\Ui\} . In view of (I2.5p . the probability of the last events is estimated to be small 
enough (see ( 16. 14}) for details), so that using 1/7 the expected number of subsets U of size 
m with the above property is exponentially small. Combining the last two arguments, 

with high probability {U* 1 ] > (r — 1 — r])\U\ whenever \Ui\ > (rj/2r)\U\. 

This completes the argument to estimate the probability in (12.31) and thereby proves Propo- 
sition [TJ 



2.2 Isoperimetric inequality in Proposition [2] 

Recall the definition of M y from (11.21) . We need some more notations for Proposition [2j For 
any subset W of V n , let W° be the subset of vertices which are in W and have at most 
1 neighbor in W c , and W 1 be the subset of vertices which are in W c and have at most 1 
neighbor in W c . So 

W° := {y e W : \H y n W\ > r - 1}, /3 (W) := liy !/!^ 

W 1 := {y G W c : \J\f y n W\ > r - 1}, ^(W) := \W l \/\W\. (2.6) 

The idea behind these definitions is that if W c is occupied at tine t in the threshold-two 
contact process, then the subset of V n , which cannot be occupied at time t + 1, is 



((W C )* 2 ) C = W°UW\ and {{W C Y 2 ) C 



\w°\ + iw 1 ! 



By 7, e(W°, (W°) c ) > (r - 2 - (2r - A)r])\W°\ with high probability if \W\ < e(r])n. But 
e(W°, W c ) < \W°\ by the definition of W°. So if e(W°, (W°) c ) > (r - 2 - (2r - i)rj)\W°\, 



then 

e{W°, W\W°)= e(W , {W°) c ) - e{W , W c ) 
> (r - 2 - (2r - 4)r/)|iy | - 

Using e(W°, iy c ) < \W°\ again with Wo C W and the last inequality, we have 

e(W, W c ) = e(W \ W°, W c ) + e(W°, W c ) 

< r\W \ W°\ - e(W \ W°, W ) + \W°\ 
= [r- (2r - 4)(1 - r])/3 (W)]\W\. 

Each x G dW has e({x}, W) > 1 while each x e W 1 has e({x}, W) > r — 1. So using the 
previous result and the definition of 0i(W), 

\dW\ < e(W,W c ) - (r-2)|W 1 | 

< [r - (2r - 4)(1 - r])/3 (W) - (r - 2)^(W)]|W|. 
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Now if (2r - 4)(1 - 77)/3 (W) + (r - 2)/3i(W) > 2 + 77, then the above implies that 
|cW| < (r — 2 — 77)|W|, which has a small probability as mentioned earlier. From another 
viewpoint, 

(r - 2)\W 1 \ < e{W,W c ) - \8W\. (2.7) 

By II, if \W\ = m, then 

e(W,W c )-\dW\ < (1 + 2^)1 W| with probability > 1 -exp (-(1 + 277)771 log(n/m) + A 2 m) , 

and combining with 777 the expected number of subsets W with e(W, W c ) — \dW\ > (1 + 
277) |W| is exponentially small if \W\ < e(j])n. Therefore, 

with high probability e(W, W°) — \dW\ < (1 + 2r/)|W| whenever \W\ < e(rf)n. 

From (12771) . if e(W, W^ c ) - \dW\ < (1 + 2r/)|iy|, then /^(W) < (1 + 2rj)/(r - 2). 

Combining the last two observations, and noting that the maximum value of /3q + Pi 
under the constraints (i) 2(1 - 77)^0 + Pi < (2 + rj)/(r - 2) and (ii) f} x < (1 + 2r/)/(r - 2) is 
achieved when both constraints are equalities, we see that with high probability 

1 1 + 277 3 2 3 

2(r^2) + 7^2" - 2(r^2) + 7^2^ ~ 2(r^2) + ^ 

for r > 4, and Proposition [2] is established. 



3 Behavior of £ t starting from a small occupied set 

In this section, we will use Proposition [TJ to prove Theorem [2j 

Proof of Theorem^ If p e (0, 1), we can choose 77 > so that (p + f])(l + i]) equals any value 
between p and 1. To fix idea, we want to choose 77 > so that (p + r])(l + r]) = (l+p)/2. The 
roots of the quadratic equation r/ 2 + (1 + p)rj + p = (l+p)/2 are r]± = (— (1+p) ±y3 + p 2 )/2. 
Clearly t?_ < 0. Since p G (0, 1), (1 + p) 2 < 3 + p 2 , which implies (1 + p) < ^3 + p 2 and so 
r] + > 0. We choose 

77 = v (p) := ^ 3 + p2 ~ ( 1 _±_g) > Q so that (p + 77) (1 + 77) = < 1. (3.1) 

Next we take e 2 (p) := e 3 (77(p)), where e 3 (-) is as in Proposition [1] and ?7(-) is as in (13. ip . Since 
e 3 (-) and T)(-) are continuous, so is e%{-). Also note that e 3 (-) is increasing by Proposition [U 
and 

dr l P 1 / n / / o , 2 

— — = , < 0, as p < a/3 + V , 

5p 2v^HV 2 

which implies that 7/(-) is decreasing. Combining these two observations, e 2 (-) is decreasing. 
Having chosen e 2 , let 

G := n^ )nJ E^, where E m = E(m,(l + 77)777) is the event defined in Proposition [TJ (3.2) 
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The argument for (i) consists of two steps. 

Step 1: In the first step we show that for suitable choices of Cqi > and b G (0,1), if 
\A\ < t2^, then the number of occupied vertices in the threshold-two contact process £f 
reduces to n b within time Coi log n. The argument of this step goes through for any choice 
of b G (0, 1). But for future benefits we will choose b with the following desirable property. 
First note that using the inequality (1+p) < a/3 + p 2 , 

a/3 + p 2 3+p 2 (1 + pf + 2(1 - p) , . , . v 1-p 

— < — ; r = ; : , Which implies 77 < . 

2 2(1 + p) 2(1 +p) P ' 1+p 

By the last inequality, 

1 + , < so that -M±l. < 1 and |°f ^ + P \\ = + "> 6 (0, 1). (3.3) 

1+p' log 2/ 1+p log(2/(l+p)) + log(l+p) 



The assertion in (13. 3 j) suggests that we can choose 



6 = 6(p) g (0, 1) small enough, so that b + (b + 1) , ^ ,! NN < 1 and 6 < r/ 2 /16r. (3.4) 

log(2/(l+pj) 

Having chosen 6, let A be any subset of vertices with \A\ < e^n, and define 
v := min {t : < n b ) , 

4 := (le^l < |^i|j,JV t :=nUi^fort>l,JVo:={|^|<e 2 7i}, 



L t := | at most (p + 77) (1 + rj) many vertices of ((.f)* 2 are occupied at time t + l| . 
Now if L t occur, then by the choice of 77, 

|£t A + i| < (P + ijXl + ij) = fi-^j \g\ • So J t+1 D Lt. (3.5) 

By the definition of (^)* 2 , each vertex of (£^)* 2 will be in ^ 1 with probability p, and for 
G n G Q, |(£^)* 2 | < (1 + ??)|^ A | on the event N t . So using the binomial large deviations, 
see Lemma 2.3.3 on page 40 in |Dur07] . and the stochastic monotonicity property of the 
Binomial distribution, 

P Gn , p {L c t nN t \if) < P(Binomial((l + V )\if\,p)>(p + V )(l + r 1 )\^\) 

< exp(-F((p + v)/pW + v)\tf\), (3-6) 

where T(x) = xlogx — x + 1 > for x ^ 1. Since > n b on {t < v}, we can replace 
in the right side of (13.61) by n b to have 

P GniP (L° HN t n{t< u}) < P GntP (L c t HN t n > n b }) < exp(-r((p + r 1 )/p)pn b ). (3.7) 
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Combining (13. 5p and (13. 7\i we get 

P Gn A Jt+i n N t n {t < u}) < p GniP (L c t nN t n{t<v}) 

< exp(-T((p + rj)/p) pn b ). (3.8) 

We choose 

C 01 (p):=(l-b(p))/log(2/(l+p)) to satisfy n = n fe , (3.9) 



so that N lCoilogn] C {|ef Coilognl l < [(1 +p)/2f^ n \A\ < n b }. Hence {u > [C 01 lognl} C 
Nr Coilogn y Therefore, recalling the definition of N t and noting that N£ is the disjoint union 

u* =1 V s c niV s -i), 

P Gn , P {v> [Coilogn]) = P Gn , P {{v>\C 01 \ogn]}nNf Coilogn] ) 

< P Gn , P [uS 1 lognl ( r t n iv t _i n {i/ > t - 1})" 

[Coi log n] 

< ^ P GniP (J, c niV t _ 1 n{z/>t-i}). 
t=i 

Using (13.81) we can bound the summands of the above sum, and have 

PG n , P ( u> [Coilogn]) < [C i log n] exp(-T((p + r])/p)pn b ) < exp(-T({p + rj)/p)pn b /2) 

(3.10) 

for large enough n. 

Step2: Our next goal is to show that starting from any subset B of size \B\ < n b , the 
threshold-two contact process dies out within time C02 log n for a suitable choice of 
Cq2 > 0. Note that we always have |^x| < |(£t B )* 2 |- I* 1 addition, for G n G G we have 
|(£^)* 2 | < (1 + r i)\^f\ only when \ < €2(p)n. Keeping this in mind, we recall the choice of 
b from ( 13. 41) and choose 

CM ■= (6+l)/log(2/(l+p)) to satisfy 6 + C 02 log(l + r/) < 1, (3.11) 

so that for G n £ G and t < C02 log n and large enough n, 

|Cf I <(! + »?) |& B i| < • • • < (1 + < (1 + < n b +^2iog(i+r,) < £2 ^ n 

Now if F t = a{£? :0<s<t}, then 



E c n , P ( |^i I I ?t) =P feT* » and so ( 3 - 12 ) 
for t < C02 logn and G n G G, E Gn , p ( |^| | T t ) < p(l + r?)|£f |. 

Iterating the above inequality, 

(lefctaio.nl I) < b(! + ^)] Co2logn i^l for G n G 0. 
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Now by the choices of 77 in (13. 3p . p(l + r/) < (1 + p)/2, and by the choice of C02 in (13. lip . 

[(l + p)/2] Co2l °s n = n-( 1+fe ). So 



1 + 



C02 log n 



2 / 



ip(i+^^»iefi< 

Combining the last two inequalities, 

^ P (kfc f)2 io g nil)<^for G n eG 
Finally using Markov inequality, 



n b = 1/n. 



^G n ,P (\£fc 2 logn] 



>l)<^(|efc 02 io g nil)<-for G n eg 



Combining with ( 13 . 1 Of) . and using the Markov property of the threshold-two contact process 
under the probability distribution Pc n ,p, we get the result in (i) for C (p) := C i (p) + C 02 (p) , 
where C i is as in (I3.9P and C 2 is as in (13. lip . 

To show (ii) we use Proposition [1] and the fact from (11.11) that P(-) < cP(-) to have 



\e 2 (p)n\ 

Hq c )< Yl HE m )<cc 3 



m=l 



[e2(p)n\ / 2 \ \n b ~\-l , 2 

El V ■, n \ sr^ 1 V 1 n 

ex P -^ ml °S — + > , exp -— mlog — 

V or ml ' V 8r m 

m=rn i 'l V 7 "1=1 v 



Noting that the function <f)(i]) = r]log(l/r]) is increasing for rj e (0, 1/e) (see ( 18. 2ft ) and 
recalling that ^(p) < 1/e by its definition, mlog(n/m) = n<p(m/n) is an increasing function 
of m for m < €2(p)n. So we can bound the summands of the last display by the first terms 
of the respective sums to have 



P(£ c ) < cC 3 



{ n — n ) exp 



n log(n/n ) 1 +n exp(— (rj /8r)logn) 



as b < r] 2 /16r by our choice in (13 .4p . 



□ 



4 The critical value p c 

In this section, we show that the critical value p c is in the interval (0, 1). The fact that p c > 
follows as a consequence of Theorem [2j 

Proof of CorollaryUi If % := a{^ s : < s < t}, then, as observed in (l3~T2l . E GntP {\& +1 \ \U t ) = 
p\(£,t)* 2 \ — n P- So using Markov inequality, 

if K t := > 3np/2}, then P Gn , p {K t+1 \U t ) < ~. 



13 



Using properties of the conditional expectation, 



so that EG n , p l n t+i Ks < ^EQ ni pl n t_ iKi> . Iterating the last inequality, 

Pc^i^T^s) < (2/3) LlognJ < (3/2)n~ log ( 3 / 2 ). (4.1) 

Now since e 2 '■ (0, 1) i-> (0, 1) is decreasing and continuous, by intermediate value theorem 
there is a unique p Q G (0,2/3) such that e 2 (p ) — 3po/2 and for p G [0,p ), e 2 (p) > 3p/2. 
So if p G [0,p ), then (14.1 p suggests that drops below e 2 (p) for some s < logn with 

Po n ^-probability > 1 — (3/2)n~ log ^ 3 ^ 2 K Combining this with (i) of Theorem [2], noting that 
[lognj + \Cq(p) logn] < \(Co(p) + 1) logn] , and using Markov property of Pg„,p, we have 

sup P Gn , P (£f(Co(p)+i)iognl ^ 0) = o(l) for p G [0,po) and G n G Q. 

G n dQ 

This together with (ii) of Theorem [2] proves the desired result. □ 

Now we show that p c < 1 using Proposition [2j 

Proof of TheoremUi Given rj G (0, 1/4) let 64(77) be the constant in Proposition [2] and take 
€\ := €4. Since r > 4 and 77 < 1/4, 3/(2r — 4) < 3/4 < 1 — 77 so that the fraction in f ll.4p is 
< 1. For p between this fraction and 1, we can choose 5 = 5(rj,p) > such that 



{p-5)\l-[^— l +vje x j>l-e 1 . (4.2) 

For t = 0, 1, . . . if \$\ < [(1 - ei/2)nj, then let tf t = and if |#| > [(1 - £i/2)nj, we 
have too many vertices to use Proposition [21 so we let U t be the subset of Q consisting of 
[(1 — £i/2)nj many vertices with smallest indices. Thus \U£\ > ein/2 for any £ > 0. We 
begin with some notations. For t > let 

f| := {l&'l > (1 - d)"} , 0,:=nU/., 

== {K|a"-(2^4+")l C, « 
T t := I at least (p — 5) |t/ t * 2 | many vertices of U^ 2 are occupied at time t + 1} . 

On the event 5 t n T t , > (p - S)\U t * 2 \ > (p - 5)[n - (3/(2r - 4) + 77)|£y t c |], and on the 

event O t , \Q\ > (1 — ex)n so that |C/ t | = min{|£*|, [(1 — t\/2)n\} > (1 — t\)n and hence 
\Ut \ < e\n. Therefore, using (14. 2p it is easy to see that on the event S t H T t fl O t , 



I&i|>(p-*) ( 1 -( 2 7^4 + r ') e ^>( 1 



eijn. 
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So It+iHOt D S t nT t nO t for any t > 0. Next we see that if we take F t := F(\U?\, (3/(2r-4) + 
r])\U£\), where F(-, •) is defined in Proposition |2l then P<3 n , p (S' t |C/ t ) > l F c, since |(t/ t * 2 ) c | < 
(3/(2r — 4) + r])\Ut \ on the event St- Taking expectation with respect to the distribution of 
G n , PJSt\Ut) > P(-^ c )- As noted above, \U^\ < e\n on the event O t . So, recalling from (11. ip 
that P(-) < cP(-), we can apply Proposition |2] with m = \U^\ to have 

P p (StnO t \U t ) <P p (^n{|^ c | <e x n}\U t ) <cC 4 exp(-(r ? /8)|f/ t c |log(n/|^ c |)). 

Since t\ = e 4 < 1/e by (17.101) . combining the facts that the function 0(7/) = 7/log(l/^) 
is increasing on (0, 1/e) (see (I8.2p ) and \U£\ is always > ei77/2 by its definition, we have 
(/>(\Ujr\/n) > 0(ei/2) or equivalently \U^\ \og(n/\U%\) > (ei/2)nlog(2/e 1 ) on the event O t . 
Keeping this in mind, we can increase the upper bound in the last display to have 

P P (^ C H O t ) < P p (S{ n {ei77/2 < \U t c \ < em}) < cC 4 exp (-||l g(2/e 1 )7i) . (4.3) 

On the other hand, using the binomial large deviation, see Lemma 2.3.3 on page 40 in 
[Dur07j . 

P Gn , p {T t \Uf)>l-e W {-T{{p-5)/p)p\Uf\) 1 (4.4) 

where T(x) = x logx — x + 1 > for x ^ 1. As noted earlier in the proof, on the event Ot, 
\^}\ > (1 — ei)n so that \U t \ = min{|^|, [(1 — e i/2) n \} > (1 — e\)n. Therefore, on the event 
S t H O t , \U^ 2 \ > [1 — (3/(2r — 4) + r))ei]n. Keeping this in mind, we can replace \U^ 2 \ in the 
right hand side of (14.41) by [1 — (3/(2r — 4) + r))ei]n to have 

P Gn , p (T t c HS t n O t ) < P Gn , p (T t c n {\U; 2 \ > [1 - (3/(2r - 4) + r/) ei ]n}) 



< exp l-T((p-5)/p)p^l- - + eAn) . (4.5) 

The same bound also works for the unconditional probability distribution P p . Combining 
these two bounds of (14.31) and (14. 5p . and recalling that I t+ i fl O t D St H T t PI Ot, 

p p (/, c +1 no t ) < p p ((S t nT t ) c no t ) < p p (s;no t ) + p p (T t c ns t no t ) < dexpHM^n), 

where C\ = 2max{l,cC 4 } and 

d(v,P) = ^min|^log(2/e 1 ),r((p- ^1 - - - 

Hence for r = exp(ci(7/,p)n), we use the above estimate of P p (I£ +1 fl O t ) and the relation 
between O t and I t to have 



\Ug T \e t \ <(l-ei)«) = Pp(u t L : J ^ 



LtJ-i 



P p (/ t c +1 nO t ) < C'it exp(-2ci(77,p)n) = Ci exp(-ci (77, p)n) 



t=o 



and we get the desired result. □ 
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5 First order phase transition at p c 

In this section, we use Theorem CD and [2] to prove Theorem [3j 

Proof of Theorem^ First we estimate the probability P p (p n > e 2 (p)) for p G (p c , 1). Let 
o"i = [exp(n 1 / 2 )] and a 2 (p) = [C (p) log n] , where C (p) is as in Theorem [2j Depending on 
the fate of the process ^ at time o~\ + o 2 and whether G n G Q or not, where £ is defined in 
Theorem El we have 

P p ( Pn <e 2 (p)) = P p (\C 1 \<e 2 (p)n) 

< Pp(C 1+CT2 = 0) + EP GmjP (iej < e 2 (p)n , C 1+CT2 ^ 0) 

< P p (e i+CT2 = 0) + El gc + E [l e P Gm , P (|^J < e 2 (p)n , £ 1+CT2 ^ 0)] . (5.1) 

By the definition of p c in fll.3|) . the first term in the right side of ( 15. 1ft is o(l) for p G (p c , 1). 
By the estimate in (ii) of Theorem [2J the second term is also o(l). To bound the third term 
in (15.11) we use Markov property of Pg„, p and the estimate in (i) of Theorem [2] to have 

M'twClft I < ^(P) , C 1+CT2 ^ 0) = E P <w(£ = A)lgP Gn , p (C 2 ^ 0) 

A:| J 4|<e 2 (p)n 

< (i) y, p G m M x = A). 

A:\A\<e2{p)n 

Combining the last three observations, 

P P ( P n < e 2 (p)) < o(l) + o(l) + o(l) KP^A&x = A ) = (!)- 

A:|^|<ne 2 (p) 

Since p c < 1 by Theorem CD and e 2 (p) > for p G (0, 1) and e 2 (-) is a decreasing continuous 
function by Theorem [2], e 2 (p c ) > and for any 5 G (0, e 2 (p c )), there exists p' > p c such that 
e 2 (p') > e 2 (p c ) - 5. 

Therefore, using the fact that e 2 (-) is a decreasing function and the stochastic mono- 
tonicity of the probability distributions P p , p G [0,1], which is discussed in the introduction 
before Theorem [3j for any p G (p c , 1] 

P P (Pn > e 2 (p c ) - 5) > P p (p„ > e 2 (p')) 

> P P A P '(Pn > e 2 (pAp')) = 1 -o(l), 

where p Ap' = min{p, p'} > p c . So letting n — > oo the desired result follows. □ 

6 Proof of the first isoperimetric inequality 

In this section, we present the proof of the isoperimetric inequality in Proposition [TJ 
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Proof of Proposition^ In view of (I2.2p . it suffices to estimate the probability F[H(m, (r — 
1 — t])m), where H(m, k) = {3U G V n : \U\ = m, \U* l \ < k}. Recall the definitions of Uq and 
Ui from ( 12. II) . We need some more notations to proceed. Given n > define the following 
events for a subset U C V n . 

Au := {ll/xl > (v/2r)\U\}, B v := {\U* l \ < (r - 1 - r,)\U\}, ( , 

D u :={e(U,U c )<(r-2- V )\U\}. 

There are three steps in the proof. 

Step 1: Our first step is to estimate the probability that there is a subset U of vertices of 
size m for which By D A v occurs. On the event Ay, \Uo\ > (1 — rj/2r)\U\ and so e(U, U c ) > 
r\U Q \ > (r -7)/2)\U\. Also on the event B v , \dU\ < lU* 1 ] < (r - 1 From these two 

observations we have 

F^BuHAfj) < ¥({\dU\<(r-l-r ] )\U\}r\{e(U 1 U c )>(r-r ] /2)\U\}) 

< F(e(U,U c )-\dU\>(l + r]/2)\U\). (6.2) 

Combining (16.21) with the bound in (ii) of Lemma 18.41 

if \U\ = m < e 5 n, then ¥(B n D A v ) < exp [-(1 + r)/2)mlog(n/m) + A 2 m] . (6.3) 

Suppose 

Fi := U{ UGVn :\u\=m} (Bu n A v ) . 
Using (16.31) and the inequality in Lemma 18. H 

if m < e^n, then F(Fi) < [ ) exp [—(1 + i]/2)m\og(n/m) + A2771] 

\m J 

< exp [— (77/2)777. log(n/m) + (1 + A 2 )m] . (6.4) 

If m is small enough, then the above estimate is exponentially small, and so with high 
probability there is no subset U of size m for which By D A^ occurs. 

Step 2: Our next step is to estimate the probability that there is a subset U of vertices for 
which Ay occurs and e(Ui, Uf) < (r — 2 — r))\Ui\. If Ay occurs for some subset U of size m, 
then \Ui\ E [?7777/2r, m] . So we consider all possible subsets having size in that range, and let 

F 2 := ^{W:(f)/2r)m<\W\<m}Dw- 

Then using Lemma [8.21 with a = 1 — (2 + i])/r and the inequality in Lemma [8.11 

P(F 2 ) = P (U m / 6[W2r , m ] ^{W:\W\=m'} {<W, W°) < (r - 2 - r])m'}) 



m' \og{n /m!) + Aim' 



m'S [r)m/2r, m] 

< C 5 exp{-(r]/2)m'log{n/m') + (1 + A^m') . (6.5) 

ml £[r]m/2r,m] 
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Noting that the function <p(rj) = rjlog(l/r]) is increasing on (0,1/e) (see (18.21) ). if m < 
n/e, then for m' G [r)m/2r,m], m'\og(n/m') > (r]m/2r) log(2r n/rjm). Using this inequal- 
ity and the fact that (r/ / 2r) \og(2r / r/) > 0, we can bound each summand in ( 16. 5ft by 
C*5 exp(— (i]/2)(r]/2r)m log(n/m) + (1 + Ai)m). As there are fewer than m terms in the 
sum over m' in ( I6.5p . if we use the inequality m < e m for m > 0, and 

ifm<n/e, then P(F 2 ) < C 5 exp {-{r)/2){r]/2r)m\og{n/m) + (2 + Ai)m) . (6.6) 

If m is small enough, then the right-hand side of (16.61) is exponentially small, and so with high 
probability there is no subset U of size m for which Ajj occurs and e(Ui, U^) < (r — 2 — i])\Ui\. 
Step 3: Our final step is to estimate the probability that there is a subset U of size m for 
which Bu occurs assuming i*\ and F 2 do not occur. Noting that U* 1 is a disjoint union of 
U\ and dU, and \U\ = \U \ + \Ui\, a little arithmetic gives 

\u* l \ =1^1 + \du\ 

=(r - 1 - r))\U\ + \dU\ - {r -2-rf)\U x \-{r -I- rj)\U \. 

Letting 

A{U) = \dU\ - (r - 2 - 77)|Z7l| - (r - 1 - ^)|Z7 |, (6.7) 

we see that if B\j occurs, then A(U) has to be negative. Also if \U\ = m, then by the 
definition of F u B v n Ff C B v H Ay, and on the event D F 2 C , > (??/2r)|£/| and so 
e(U\, Uf) > (r — 2 — r/)|[/i|. Combining these two observations, 

F(BunF?nF£) ^(BunAuCiFZ) <P({A(C7) <Q}n{e{U u U^) > (t- — 2 — »7)|C7i|}). (6.8) 

Now by the definitions of U and Z7i, 

e([/ 0) [7 c ) = e(C/ , f/ c ) = r|E7 | and e(U u U[) = e(U u U c ), so that 

e(U, U c ) = e(U , U c ) + e(U u U c ) = r\U \ + e(U u U[), (6.9) 

and a little algebra shows that {A(U) < 0} = {e(U, U c ) - \dU\ > (1 + r/)|f/ | + e(L7i, E/f) - 
{r-2-rj)\U x \}. Also e(U u Uf) < r\Ux\. So 

F({A(U) < 0} n {e([/ 1; [/[) > (r - 2 - r?)^!}) (6.10) 
= P (W^i, ^i c ) = (r - 2 - 7] + 7)|^|} n {e(C7, f/ c ) - > (1 + ^)|C/ | + 7 |^i|}) ■ 

76(0,2+^) 

Combining (16.81) and (I6.10p . and recalling that |£7x| G [r)m/2r,m], 
if we write R = r — 2 — rj, 

and if r(r/,k) := F(e(U u U<) = (R + ^U^, {U^ = k) and 

s(j, fc) := P (e(C7, U c ) - |<9C/| > (1 + t^I + 7^i| I e(U u U{) = (R + 7 )|t/i|, = fc) , 
then P(Bc n F x c n F 2 C ) = ^ r( 7 , fc) s( 7 , fc). (6.11) 

7G(0,2+r/) fce[r/m/2r,m] 
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In view of Mty . if L= (R + ^)k + r(m- k), then {e{JJ x ,U{) = (R + j)^} D {\Ui\ = k} = 
{e(U,U c ) = L}n {\Ux\ = k}. So 

s ( 7) k) = P(e(C7, t/ c ) - |0£/| > 7 £; + (1 + 77) (m - fc) | e(t7, f/ c ) = L, \U X \ = k). 

Since under the conditional distribution P(-|e(Z7, U c ) = L) all the size-L subsets of half- 
edges corresponding to U c are equally likely to be paired with those corresponding to U, the 
conditional distribution of e(U, U c ) — \dU\ given e(U, U c ) and \U\\ does not depend on \U\\. 
So we can drop the event {\U X \ = k} from the last display and use (i) of Lemma EH with rj 
replaced by [jk + (1 + 77) (m — k))/m to have 

s(7, k) < exp (— {7/c + (1 + r])(m — k)} log(n/m) + A 2 m) , when m < €571. (6-12) 

In order to estimate r( 7 , fc), we again use (16.91) and recall that R = (r — 2 — 77) to have 

r( 7 ,A;) = P(e(L r i, l^) = (-R + j)k, \U X \ = k) 

= P(e(C7, £/ c ) = (i2 + 7) A; + r(m - fc), \Ux \ = k) 
< F(e(U,U c ) =rm- (2 + 77-7)*;), 

Using Lemma [8.21 with a = 1 — (2 + 77 — rfk/rm, 

/ 2 + 77 - 7 \ 
r (7, fe) < C5 exp I log(n/m) + Aim J . (6.13) 

Combining (16. lip . (16. 121) and (16 .13j) . if m < 6577,, then 

h n n f 2 c ) 

r r /o 1 „ , „,\ 

+ (1 + rj)(m — k) ? \og{n/m) + (Aj + A 2 )m 



E E ^exp [-{( 2 + ^ + 7 )fc + (l+7 / )(m-A:)} 



< 

7S(0,2+?7) fce[??m/2r,r?i] 

Noting that there are fewer than rm terms in the sum over 7 and at most m terms in the 
sum over k, and using the inequality m 2 < e m for m > 0, the above is 

< C^rm 2 exp [—(1 + r]/2)m\og(n/m) + (Ai + A 2 )t7i] 

< C 5 r exp [-(1 + 77/2)m log(7i/m) + (1 + Ai + A 2 )m] . (6.14) 

Recalling the definition of H(m, (r — 1 — 77)771) and considering whether the events Fi, i — 1, 2, 
occur or not, 

P(#(m, (r - 1 - 77)771)) = P (u mul=my Bu) 

< P(Fi) + P(F 2 ) + P (u { ^ : | t ,| =m} (Sp n F x c n F 2 C )) 

< P(Fi)+P(F 2 )+ E P{B v nF?nF$). 

{U:\U\=m} 
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Combining (16. 4p . ( 16. 6 p and (I6.14p . and using the inequality in Lemma 18.11 to estimate the 
number of terms in the sum, if m < min{l/e, €5(rj)}n, then 

¥(H(m, (r — 1 — i])m)) 

< P(Fi) + P(F 2 ) + rMCferexp [-(1 + r]/2)m\og(n/m) + (1 + A x + A 2 )m] 

< P(Fi) + P(F 2 ) + C 5 rexp [-(r]/2)m\og(n/m) + (2 + A 1 + A 2 )m] 

< C 3 exp[-(r] 2 /Ar)m\og(n/m) + (2 + A 1 + A 2 )m], (6.15) 

where C3 = 3max{l, C57"}. To clean up the result to have the one given in Proposition [TJ 
choose e 3 such that 

(r/ 2 /4r) log(l/e' 3 )/2 = 2 + A 1 + A 2 , and 63(77) := min{l/e, e 5 { V ), 40?)}, (6-16) 

where e$ is defined in (18.81) . So for any m < e^n, the estimate in (I6.15P holds, and 

(r/ 2 /4r) log(n/m)/2 > (?7 2 /4r) log(l/e / 3 )/2 = 2 + A 1 + A 2 , 

which gives the desired estimate for the probability in (I2.3p . and thereby, in view of ( 12. 2p . 
provides the required bound for the probability in Proposition [TJ 

To finish the proof of Proposition [1] it remains to check that ez(-) is increasing. By the 
definition of e 5 (-) in (18. 8p and the properties of /?(-,■) in Lemma 15731 e 5 (-) is increasing. Also 
by the definition of e' 3 in (I6.16p . log(l/e' 3 (-)) is decreasing and hence e 3 (-) is increasing. Since 
minimum of increasing functions is still increasing, we conclude from (16.161) that es(-) is 
increasing. □ 



7 Proof of the second isoperimetric inequality 

In this section, we present the proof of the isoperimetric inequality in Proposition |2j 

Proof of Proposition^ Recall the definitions of W l and fli(W) from (12. 6p . We need some 
more notations to proceed. Given rj > 0, let 

Qw := bo(W) + h(W) > ^-^y +v\ , Rw := hl(W) > . 
We divide the argument into three steps. 

Step 1: Our first step is to estimate the probability that there is a subset W C V n of size 
m for which R w occurs. Since each x 6 dW has e({x},W) > 1 and each x G W 1 has 
e({x},W) > r - 1, 

e(W, W c ) > (r - l)!^ 1 ! + (\dW\ - \W l \) = (r - 2)|W 1 | + \dW\, (7.1) 
and so R w C {e(W, W c ) - \dW\ > (1 + 2r])\W\}. Therefore, using (ii) of LemmaETJ 

if \W\ = m< e 5 n, then F(R W ) < exp[-(l + 2rr)m\og(n / m) + A 2 m]. (7.2) 
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Now if 

Mi := U{w.\w\=m}Rw, 
then using ( 17.21) and the inequality in Lemma 18.11 

if m < e^n, then P(Mi) < f j exp[— (1 + 2r])mlog(n/m) + A 2 m] 

< exp[— 2r]mlog(n/m) + (1 + A 2 )m]. (7.3) 

If m is small enough, the above estimate is exponentially small, which implies that with high 
probability there is no subset W of size m for which Rw occurs. 

Step 2: Our next step is to estimate the probability that there is a subset W C V n for 
which Q w n R c w occurs and e(W°, (W°) c ) < (r - 2 - (2r - 4)^)|W°|. If Q w D i^- occurs for 
some subset of size m, then a little algebra shows that for r > 4, 



2(r-2) ' r-2 ~ 2(r-2) 7 

and so | | G [m/(2r — 4),m]. For this reason we consider all possible subsets having size 
in that range and let 

M 2 := U {f / : | C /|e[m/(2r-4)H}{ e ( C/ » U c )<{r-2- (2r - A) V )\U\}. 

Applying Lemma IH^2| using the inequality in Lemma I87TI and then using an argument similar 
to the one leading to (16. 6p . 

if m < n/e, then 

P(M 2 ) = P (U m , e[m/{2r _4),m] U { u:\u\=m>}{e{U,U e ) < (r-2- (2r-4)77)|Z7|}) 

n 



< £ Oexp 

m'e[m/(2) — 4),m] 



2 + gr - 4)^ / 
m log(n/m J + Aim 



r ] 
< C5 exp — m log(n/m) + (2 + Ai)m . (7-4) 

If m is small enough, then the right hand side of ( 17.41) is exponentially small, and so with high 
probability there is no subset W of size m for which Qw H R w occurs, and e(W°, (W°) c ) < 
(r-2- (2r - A)rj)\W°\. 

Step 3: Our final step is to estimate the probability that there is a subset W C V n for 
which Qw occurs assuming Mi and M 2 do not occur. If \W\ — m, then by the definition of 
Mi, Qw H Afj; C Q w n i?^. On the event Q w n R w D M 2 C , |V^°| e [m/(2r - 4), m] and so 
e(W°, (W°) c ) > (r - 2 - (2r - 4)r/)|iy°|. Also by the definition of W , e(W°, iy c ) < 
Combining these three observations with the fact that W° C W, on the event Qv^nM^flM^, 

e{W°,W\W°) = e(W°,(W°) c ) -e(W°,W c ) 

> (r-2- (2r - A)r])\W°\ - \W°\ 

= (r-3- (2r-4>7)A,(W)|W|. (7.5) 
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Next we see that W is a disjoint union of W° and W \ W°, and {W \ W°) c is a disjoint union 
of W° and W c . So 

e(W,W c ) = e(W \ W°, W c ) + e(W°, W c ) 

= e{W\ W°,{W\W°) C ) -e{W\ W°,W°) + e{W°,W c ). (7.6) 

Combining the inequalities in (17.50 and (17.60 . recalling that e(VK \ W , (W 7 \ W /0 ) c ) < r\W \ 
W°\, and again using the inequality e(W°, W c ) < \W°\, we see that on the event Qw HMf D 
Ml 

e(W,W c ) < r\W\W°\ -e(W°,W\ W°) + e(W°,W c ) 
< [r - (2r -A)(l-r])/3 (W)}\W\. 

Therefore by (17. ip . 

\dW\ < e(W,W c ) - (r - 2)\W 1 \ 

< [r -(2r -A)(l-r])/3 (W) - (r - 2)f3 1 (W)}\W\. (7.7) 

Now we show that (2r-A)(l-r])/3 (W) + (r-2)/3 1 (W) > 2 +r] on the event Q w nMfnAf£. 
By the definition of M u f3 1 (W) < (1 + 2r/)/(r - 2) on the event Qvk n M{ n M 2 C . So if 
(2r — 4)(1 — ?7)/3 (W / ) + (r — 2)/3 i (W / ) < 2 + 77 on the same event, then noting that the 
maximum value of (3q + /3i under the constraints (i) (2r — 4)(1 — i])j3 + (r — 2)/3i < 2 + 77 
and (ii) /?i < (1 + 2rj)/{r — 2) is attained when both constraints hold with equality, a little 
algebra shows that 

1 + 277 1 3 2 3 

01 + & ~ T^2 + 2^~2) = 2^~2) + —2 V ~ 2^~2) + V 

on the event Qw H Mf D Mf. But the definition of Qvy contradicts that. So, on the event 
Q w n Aff n Mf, we must have (2r - 4)(1 - rj)Po(W) + (r - 2)ft(W) > 2 + 77 and hence 
< (r - 2 - r7)| W\ by dZTT]). Thus F(Q W n Mf n M 2 C ) < P(|9W| < (r - 2 - 77)] W|). In 
order to estimate the right-hand side of the last inequality we apply Lemma 18.51 to have 

if \W\=m< e 5 n, then P(Q w nM 1 c nM 2 c ) < C , 7 exp(-(l+r//4)mlog(n/m) + (l+A 1 + A 2 )m). 

(7.8) 

Recalling the definition of F(m, k) and considering whether the events Mi, i = 1,2, occur or 
not, 

P[F(m, [3/(2r - 4) + 77] m)] = P (u {W :\w\=m}Qw) 

< P(M0 + P(M 2 ) + P (u ww1=m} (Q w n M[ n M 2 C )) 

< p(Mi) + p(m 2 )+ HQwr\MtnM%). 

{W:\W\=m} 
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Combining the probability bounds in (17. 3p . (17. 4p and (17. 8p . using the inequality in Lemma 
18.11 to estimate the number of terms in the sum, if m < min{l/e, e^(r])}n, then 

F[F(m, [3/(2r-4) +rj]m)] 

< P(Mi) + P(M 2 ) + rjCVexp [-(1 + r]/A)m\og(n/m) + (1 + A x + A 2 )m] 

< P(M X )+P(M 2 ) + C 7 exp[-(ri/4)mlog{n/m) + (2 + A x + A 2 )m] 

< C 4 exp[-(^/4)mlog(ra/m) + (2 + Ai + A 2 )m], (7.9) 

where C4 = 3max{l,CV}. To clean up the result to have the one given in Proposition [21 
choose 64(77) such that 

(77/8)log(l/e' 4 ) = (2 + A 1 + A 2 ), and e 4 := min{l/e, 65(77), e^r?)}, (7.10) 

where e 5 is defined in (18.81) . So for any m < e 4 n, the estimate in (17. 9p holds, and 

(77/8) log(n/m) > (77/8) log(l/V 4 ) = (2 + A : + A 2 ), 

which gives the desired result. □ 

8 Probability estimates for e{U,U c ) and \dU\ 

We begin with a simple estimate for the number of subsets of V n of size m. 

Lemma 8.1. The number of subsets ofV n of size m is at most exp(m log(n/m) +m). 

Proof. The number of subsets of V n of size m is [2j . Using the inequalities n{n — 1) • • • {n — 

m + 1) < n m and e m > m m /m\, 

Tl \ Ti" 1 / 72.6 \ m 

< — < ( — ) = exp(mlog(r7 J /m) + m). 

□ 



ml ml \m J 



In order to study the distribution of \dU\, the first step is to estimate e(U, U c ). Because 
of the symmetries of our random graph G n , the distribution of e(U, U c ) under P depends on 
U only through \U\. 

Lemma 8.2. Let U be any subset ofV n with \U\ = m. Then for any a G (0, 1), 

F(e(U,U c ) <ar\U\) < C 5 exp (-^(1 - a)m log(n/m) + A x m 
for some constants C$ and A x . 
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Proof. Let /(it) be the number of ways of pairing u objects. Then 

it! 



' (u/2)\2 u / 2 ' 

If p(m, s) = ¥(e(U, U c ) = s), then we have 

frm\fr{n — m]\ , firm — s)f(r(n — m) — s) 
p(m,s)< 1 J )s\- JK 



s J \ s J fi^n) 

To see this, recall that we construct the graph G n by pairing the half-edges at random, which 
can be done in f(rn) many ways as there are rn half-edges. We can choose the left endpoints 
of the edges from U in ( r ™) many ways, the right endpoints from U c in ( r (" j m )) many ways, 
and pair them in s! many ways. The remaining {rm — s) many half-edges of U can be paired 
among themselves in f(rm — s) many ways. Similarly the remaining (r(n — m) — s) many 
half-edges of U c can be paired among themselves in /(r(n — m) — s) many ways. 

Write D = rn, k = rm and s = nk for 77 e [0, 1]. Combining the bounds of (6.3.4) and 
(6.3.5) of [Dur07j we get 



p( m ,. s )<c 6 ^g)'"(A) 



2\ Vk / u \ k(l-v)/2 / j rr \u\ (i?-(l+»?)*)/2 

D 



l-(l±^| (8.1) 



for some constant Cr. Now 



if 4>{rf) = r]log(l/r]), then <p'(ij) = —(1 + log//) and 4>"{rf) = — — . (8.2) 

So (/>(•) is a concave function and its derivative vanishes at 1/e. This shows that the function 
0(-) is maximized at 1/e, and hence (l/i/) 77 = e™' < e 1//e for 77 G [0, 1]. So (e 2 /r]) vk < C k for 
C = exp(2 + 1/e). If we ignore the last term of flS.ip . which is < 1, then we have 

I orm I 

¥(e(U, U c ) < arm) = P( m > s ) < C 6 (rm) 1/2 C rm (-) 



<C 6 r^ 2 m 3 / 2 C rm (- 
\ n 



{rj: ?7rmGN, r)<a} 
Ul\ r(l—a)m/2 



as there are at most rm terms in the sum and (m/n) 1 ^ 11 < (m/n) 1 ~ a for rj < a. The above 
bound is 

< Csexp ^— — (1 — a)m log(n/m) + rm log C + 3m/2^ , 

and we get the desired result with C5 = Cgr 3 / 2 and A x = rlogC + 3/2. □ 

Lemma 18.21 gives an upper bound for the probability that e(U, U c ) is small. Our next 
goal is to estimate the difference between e(U,U c ) and \dU\. In order to do that, first we 
need the following large deviation probability estimate. 
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Lemma 8.3. If T\,T2, . . . are independent random variables and Tj ~ Geometric(pi) with 
Pi = (n — i + l)/n, then for any u > and 77 G (0, iz) inere are positive constants A 2 and 
(3 = (3(u, 77) such that for large enough n and any m < (3n, 

P (T 1 +T 2 -\ h T L ( M _^) m j > wm) < exp [-77m log(n/m) + A 2 m] . 

Moreover, (3(u, 77) 4- as 7/ J, and /or /ixed 77, /3(«, 77) 75 a decreasing function of u. 

Proof. Let qi — 1 — Pi — (i — l)/n. Then for 9 < log(l/gi), 



E 



[e Wi ]=E»rf" leW =iZ C 



fe=i 



Let e = m/n, 6 > and ee e < 1/(77 — 77) so that E , e e,T * is finite for 7 = 1,2 
Using Markov inequality 

\[u—r])m\ 

P (Ti + ■ ■ ■ + T^ u _ v ) m ^ > um) < exp [— 9um] J J _Ee 



L(u — 77)771] . 



02} 



i=i 



Using e = m/n and the formula for Eexp(8Ti), a little arithmetic shows that the above is 

i=l 



< exp 



-Ouen 



qie a 



< 



exp 



L(u-7?)enJ 

1 1 1 - (7 - l)/n 

-Orjen + n • — log — 



n 



(7 - l)e e /n 



J.3) 



Since e 9 > 1, it can be verified that the function = log[(l —x)/(l — xe 9 )} is increasing, 
so that we can bound the Riemann sum for the function g(x) in (18. 3ft by the corresponding 
integral. Thus the above is 



< exp 



—Onen + n 



(u-ri)e 



log(l — x)dx 



(u-rj)e 



log(l — xe 6 )dx 



•4) 



To bound the last quantity we let 



h(9, u, 77, e) = #ne — 



log(l — x)da; — 



(<u-n)e 



log(l — xe e )dx 



Clearly h(0,u,r],e) = 0. We want to maximize h with respect to 9 keeping all the other 
parameters fixed. Changing the variables y = 1 — x and z — 1 — xe 61 , 

fa = 6*776— I / log y dy — e~ e / log z dz J 

\J l—(u-ri)e J 1 — (u— r?)ee" / 



^ne— (—(1 — (« — n)e) log(l — (u — n)e) 

+ e~ 9 (1 - (n - 7])ee e ) log (l - (77 - n)ee e )) . 



.5) 
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where to evaluate the integrals we recall (xlogx — x)' = logx. 

dh/dO = r]€ + e- e (1 - (it - 7])ee 9 ) log (1 - (u - ^ee 9 ) 

+ e~ 9 (u - rj)te 9 log (l - (u - r/)ee 9 ) - e~ e (l - (u - r/)ee 9 ) ~ ^ ee 

v ' v 1 — [u — T])ee y 

= r/e + (u — rj)e + e~ e log (l — (u — r/)ee e ^ = ue + e~ 9 log (l — (u — i])ee 9 ^j . 

dh/39 = implies exp(— uee 9 ) = 1 — {u — r])ee 9 . Letting 

[3 = (3(u,r]) be the unique positive number satisfying e~ u/3 = 1 — (u — r])(3, (8.6) 

dh/dO > if ee 9 < (3. ([HID suggests that (3 G (0, l/(u - 77)). So for fixed u,r], e < p{u,rj) 
and 9* := log(/3/e), 9* > with ee e * < l/(u — rj), and the function /i is maximized at 9*. 
Plugging the value of 9* in (18. 5p . 

h = rje log(/3/e) + (1 - (u - T])e) log(l - (u - rj)e) - Ul - (u - r/)(3) log(l - (u - r/)f3). 

Noting that the function 

(1 - 8) log(l - 8) . _ „,_, -5-log(l-5) rt , .... [-1 if 5^0 

^(5) := ^ '- satisfies t// 5 = ^ i > 0, and MS) -> <^ 

o o I if 5 — >■ 1 

[^(<5) - > -1 for 8, 8' G (0, 1), and so 

h = ?yelog(l/e) + r/e log (3 + (u — r))e[ip((u — rj)e) — ip((u — rj)(3)} 
> r/e \og{l/e) - c 2 {u,r])e, (8.7) 

where C2(u, rj) = u — rj + rj log(l//3(w, 7/)). 

To see that (3(u, rj) has the desired properties, note that if ip x (u, rj) := e~ ux — 1 + (u — rj)x, 
then for x > 0, dip x /du = —xe~ ux + x > and d(p x /drj = —x < 0. If we put x = f3(u,r]), 
use (I8.6p . and note that <p x (u, rj) < if and only if < x < /3(m, 77), then 

for w' > u,ipp( UlV )('u>',T}) > <pp(u,T]){ u ) v) = 0; ari d so we m ust have (3{u ,rj) < {3(u,r]), 
for 7/ < 7),ipp(u,rj)(u,v') > (Pi3(u,r))(u,v) = 0; an d so we must have [3(u,r]') < f3(u,rj). 

To ensure that {3(u, rj) 1 as 77 4- 0, see that if /3(w, 0) := lim^o /?(«, then using continuity 
of /3(u, rj) and f^6]l . exp(-u/3(w, 0)) = 1 - 0) and so /3(u, 0) = 0. 

Using the properties of (3{u,rj) we can show that az(u,p) is bounded above as 77 and it 
vary. From the inequality e~ y > 1 — y we have 1 — e _a: = j Q e~ y dy > j Q (1 —y)dy = x — x 2 /2 
for any x > 0. In view of (18. 6p . using the last inequality we see that 

U 2 (3 2 277 ( u 2 

l—(u—rf)/3 = e~ u/3 < 1— u(3-\ , which implies (3 > — - and so c 2 (m,77) < u— 77+77 log I — 

2 u \2r7 
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and limsup^o 02(11, rf) < u. In the other direction, /3(w, 77) — > 00 as r\ — )■ u, since for any 
Po > we can choose r] G (0,m) so that 1 — (u — r] )f3 > e~ u/3 ° (e.g. choose r] satisfying 
1 — (u — r]o)f3 = (1 + e~ uf3 °)/2) to make sure /3(u,rjo) > (3o- Thus 02(^,77) — > —00 as 77 — )■ w. 
From the behavior of 02(^,77) when 77 is close to and u , and noting that C2{u,r]) depends 
continuously on 77, 

Cq{u) := max{c 2 («, 77) : 77 G (o, w)} < 00. 

Next we recall that e{U,U c ) < r\U\ so that u G [0, r]. Since (3(u,r/) is decreasing in w, 
recalling the definitions of 02(^,77) and co(w) it is easy to see that for fixed 77, 02(^,77) is 
increasing in u, and hence so is Cq(u). Therefore, 

if A 2 := Co(r), then 02(^,77) < Co(m) < A 2 for any < 77 < w < r. 

Coming back to estimate h, we can convert (18.71) to 

h >77elog(l/e) - A 2 e. 

Plugging the bound on /i and e = m/n m (18. 4ft we get 

P (Ti H h TL( u _.^) m j > wm) < exp(-77mlog(72/m) + A 2 m). 

which completes the proof of Lemma 18.31 □ 

Now we use Lemma 18.31 to get an upper bound for the probability that the difference 
between e{U,U c ) and \dU\ is large. 

Lemma 8.4. If U is a subset of vertices of G n such that \U\ = m, then for any 77 > ; 
u G (77, r] and A 2 as in Lemma \8.3[ there is a constant e§ = 65(77) > such that for large 
enough n and m < e^n, 

(z) P ( \dU\ < (u — f])\U\ I e(U, U c ) = u\U\) < exp(-7/mlog(n/m) + A 2 m), 
(ii) F (e(U, U c ) - \dU\ > r]\U\) < exp(-7/mlog(n/m) + A 2 m). 

Proof. Since \U C \ = n — m, there are r(n — m) many half-edges corresponding to U c . In order 
to have e(U, U c ) = um, we need to choose um half-edges corresponding to U c and pair them 
with the same number of half-edges corresponding to U. Since the half-edges are paired 
randomly under the probability distribution P, all the subsets of half-edges corresponding to 
U c of size um are equally likely to be chosen under the conditional probability distribution 
F(-\e(U,U c ) = um). Noting that the subset of size um, which is obtained by choosing um 
objects one at a time from a set of size r(n — m) uniformly at random without replacement, 
has uniform distribution over all possible subsets of that size, we can assume that the half- 
edges corresponding to U c mentioned above are chosen one by one uniformly at random 
without replacement. 

Suppose Ri half-edges are chosen by the time i many distinct vertices are chosen. Let 
T[ — Rx — 1 and T[ = Ri — Ri-i for i > 2. Since each vertex has r half-edges, Ri+± < 1 + ri 
and e(U, U c ) < r\U\ so that u < r. A little arithmetic gives that for large enough n, 

n n — In— 1 n 

— < — < so that for m < — and 1 = 1, . . . , um, ri+l+rm < n. 

r 2 + r + 1 r l + r ru + r r 2 + r + 1 
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Combining these inequalities, after choosing the i distinct vertex the failure probability to 
choose the (i + l) th distinct vertex at any step is 

ri — i i 

< — r < — for i < um. 

r(n — m) — ri — 1 n 

Then, on the event {e(U, U c ) = um}, the T[ can be coupled with geometric random variables 
Ti with failure probability [i — l)/n so that T[ < Tj. So 

P(i?LMH > um\e{U, U c ) = um) = P (T[ + ■ ■ ■ + T[ (u _ v)mi > um\e(U, U c ) = um) 

u—f])m ^* UTTl) , 

when m < n/(l + r + r 2 ). If we let 

65(77) = min{l/(l + r + r 2 ), (3(r, 77/2)}, (8.8) 

where /3 is defined in Lemma 18.31 then for m < e^n we have the above inequality and can 
use the probability estimate of Lemma 18731 as f3(u,rj) > j3(r,rj). From those two inequalities 
we conclude that 

P (\dU\ <{u- rj)m \ e(U, U c ) = um) < F{R [{u _ v)m] > um\e{U, U c ) = um) 

< exp(— rjm \og(n/m) + A2m) 

for m < e^n, which completes the proof of (i). 

To prove (ii), recall that e(U, U c ) < rm. So based on e(U, U c ) we have 

¥(e{U,U c ) - \dU\ > rim) 
< ^(e(U, U c ) - \dU\ > rim, e(U, U c ) = um) 

uG(r],r]: iimSN 

^(e(U, U c ) - \dU\ > rjm\e(U, U c ) = um)F(e(U, U c ) = um). (8.9) 

If m < e^n, we can use (i) to bound the first terms of the summands in the right-hand side 
of flEU) and have 

F(e(U,U c ) - \dU\ > rim) 

< exp(— rjm log(n/m) + A 2 m) F(e(U,U c ) = um) 

< exp(— ?7mlog(n/m) + A 2 m). 

□ 

Lemma 18.41 gives an upper bound for the probability that the difference between \dU\ 
and e(U, U c ) is large. Now we use Lemma IH721 and [8741 to estimate the probability that \dU\ 
is smaller than (r — 2)\U\. 
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Lemma 8.5. Let U C V n be such that \U\ = m and rj > 0. For the constants A\ of Lemma 
\8.£\ e 5 and A 2 of Lemma 8.4, ifn is large enough and m < e^n, then 

¥{\dU\ < (r-2-rj)\U\) < C 7 exp[-(1 + ij/4)mlog(n/m) + (1 + Ax + A 2 )m] 

for some constant C7. 

Proof. First we estimate the probability P(|9Z7| = (r — 2 — r]')\U\) when 77' > 77. Noting that 
|<9f/| < e(tf, f/ c ) < r|[/| for any U C K, 

W(\dU\ = (r-2-rf)\U\) 
= £ mdU\ = (r-2-T]')\U\,e(U,U c ) = (r-2-ri' + ^)\U\). (8.10) 

76[0,2+rj'] 

For the summands with 7 > r//2, we write each summand as the product of two terms 

F(e(U,U c ) = (r-2-r/ + 7 )|[/|)P(|<9£/| = (r - 2 - rf)\U\\e(U, U c ) = (r - 2 - 7/ + 7 )|£/|). 

We can use Lemmas 18.21 to estimate the first term above. For the second term, note that 
by the definition of €5 in (18. 8 p and the properties of /?(-, •) in Lemma f8.3[ if 7 > T)'/2, then 
P(r — 2 — rj' + 7,7) > j3(r,r]/2) > e$. So if \U\ = m < e$n, we can use (i) of Lemma EH to 
estimate the second term in the last display, and have 



F{\dU\ 
<C*5 exp 



r-2-rj')\U\,e(U,U c ) = (r - 2 - rf + j)\U\) 
2 + i - 7 



m\og(n/m) + Aim 



exp(— 7mlog(n/m) + A 2 m). 



As there are fewer than rm terms in the sum over 7 and each term has the same upper 
bound Csexp(— (1 + r//2)m log(n/m) + (Ai + A 2 )m), noting that m < e m//2 for m > 0, 



7e[??72,2+ J ?'] 

< rCs exp 



?7 ')|C/|, e ([/,C/ c ) = (r-2-7 7 ' + 7)|t/|) 



mlog(n/m) + (1/2 + Ai + A 2 )m 



(8.1L 



Lemma 18.21 to have 



For the summands in ( 18. 10)) with 7 < rj'/2, we can ignore one of the two events and use 

(r-2-rf)\U\,e(U,U c ) = (r - 2 - 7/ + 7 )|[/|) (8.12) 



< 



£ p(|0tf| = 

7 e[0,r,72) 

P(e(f/,f/ c ) < (r 



77V2)|C/|)<C 5 exp 



(r-2-7 7 / + 7)|t/|) 
2 + 7772 



mlog(n/m) + Aim 
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Combining (18. lip and (18.131) . noting that there are at most rm terms in the sum over rj' 
below, and again using the inequality m < e m//2 for m > 0, 

F(\dU\ < (r - 2 - r])\U\) = mdU\ = (r - 2 - r]')\U\) 

T]'e[v,r-2] 

< (C 5 + C 5 r)exp(-(l + r ] '/A)m\og(n/m) + (l/2 + A 1 + A 2 )m) 

)7'G[r?,r-2] 

< rC${l + r) exp(— (1 + rj / '4)mlog(n / 'm) + (1 + A x + A 2 )m), 

and we get the desired result with CV = C$r(l + r). □ 

Acknowledgements. The author wishes to thank Rick Durrett for suggesting the problem 
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